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Description 

[CYCLIC REDUNDANCY CHECK 
GENERATION CIRCUIT] 

Background of Invention 
[0001] FIELD OF THE INVENTION 

[0002] The present invention relates to the field of cyclic redun- 
dancy check circuits; more specifically, it relates to a fully 
pipelined cyclic redundancy check circuit. 

[0003] BACKGROUND OF THE INVENTION 

[0004] Error checking of data transmissions between sending and 
receiving devices use a cyclic redundancy check circuit 
(CRC) implementing various CRC codes in both the send- 
ing and receiving devices. The CRC code is calculated by 
an exclusive OR (XOR) subtree. As high speed serial inter- 
connect technologies evolve, many of the standards gov- 
erning these technologies allow bandwidths well beyond 
the traditional 96 and 128 bits per cycle bandwidths, yet 
maintain the same transmission frequency as for the older 



smaller 96 and 128 bits per cycle bandwidths. As band- 
width increases, the complexity and depth of the XOR 
subtree must increase as the need to process more bits 
per clock cycle grows. Traditional CRC designs when ap- 
plied to large bandwidth data transmissions very quickly 
develop the interrelated problems of increased processing 
time and physical silicon area required to implement the 
XOR subtree. Therefore, there is a need for a more effi- 
cient CRC circuit than presently available. 
Summary of Invention 

[0005] a first aspect of the present invention is a circuit, com- 
prising: multiple W-bit packet data slice latches each hav- 
ing inputs and outputs, the packet data slice latches con- 
nected in series from a first to a last packet data slice 
latch, outputs of a previous packet data slice latch con- 
nected to inputs of an immediately subsequent packet 
data slice latch; a data partition comprising multiple data 
XOR subtree levels and having data latches between the 
data XOR subtree levels, the data partition having inputs 
and outputs, the outputs of each packet data slice latch 
connected to corresponding inputs of the data partition; a 
remainder partition comprising multiple remainder XOR 
subtree levels and having remainder latches between the 



remainder XOR subtree levels, the remainder partition 
having inputs and outputs; a combinatorial XOR tree hav- 
ing inputs and outputs, the outputs of the remainder par- 
tition and the outputs of the data partition connected to 
corresponding inputs of the combinatorial XOR tree; and 
an M-bit current cyclic redundancy check (CRC) remainder 
latch having inputs and outputs, the output of the combi- 
natorial XOR tree connected to corresponding inputs of 
the current CRC remainder latch and the outputs of the 
current CRC remainder latch connected to corresponding 
inputs of the remainder partition. 
[0006] a second aspect of the present invention is a method, 
providing multiple W-bit packet data slice latches each 
having inputs and outputs, the packet data slice latches 
connected in series from a first to a last packet data slice 
latch, outputs of a previous packet data slice latch con- 
nected to inputs of an immediately subsequent packet 
data slice latch; providing a data partition comprising 
multiple data XOR subtree levels and having data latches 
between the data XOR subtree levels, the data partition 
having inputs and outputs, the outputs of each packet 
data slice latch connected to corresponding inputs of the 
data partition; providing a remainder partition comprising 



multiple remainder XOR subtree levels and having remain- 
der latches between the remainder XOR subtree levels, the 
remainder partition having inputs and outputs; providing 
a combinatorial XOR tree having inputs and outputs, the 
outputs of the remainder partition and the outputs of the 
data partition connected to the inputs of the combinato- 
rial XOR tree; and providing an M-bit current cyclic redun- 
dancy check (CRC) remainder latch having inputs and out- 
puts, the output of the combinatorial XOR tree connected 
to the inputs of the current CRC remainder latch and the 
outputs of the current CRC remainder latch to the inputs 
of the remainder partition. 
[0007] a third aspect of the present invention is a method of de- 
signing a circuit, the method comprising: (a) providing a 
cyclic redundancy check (CRC) circuit design for a current 
CRC remainder, comprising: outputs of a packet data slice 
latch connected to inputs of a data XOR tree; outputs of a 
current CRC remainder latch connected to inputs of a re- 
mainder XOR tree; and outputs of the data XOR tree and 
outputs of the remainder XOR tree coupled to corre- 
sponding inputs of the current CRC remainder latch 
through a combinatorial XOR tree; (b) substituting a previ- 
ous CRC cycle data and corresponding previous CRC re- 



mainder for the current CRC remainder or for a previously 
subsituted CRC remainder and adding an additional 
packet data slice latch, an additional CRC remainder latch, 
an additional data XOR tree, an additional remainder XOR 
tree and an additional combinatorial XOR tree to the CRC 
circuit design without altering the a CRC remainder result 
of the CRC circuit design; (c) partitioning all packet data 
slice latches and all data XOR trees into a data partition 
and all additional current CRC remainder latches and all 
remainder XOR trees into a remainder partition; (d) com- 
bining all remainder XOR trees into a single remainder 
XOR tree and combining all data XOR trees into a single 
data XOR tree; (e) repeating steps (b) through (c) a prede- 
termined number of times; and (f) distributing the single 
remainder XOR tree in the remainder partition over two or 
more remainder XOR subtree levels, distributing all addi- 
tional CRC remainder latches over one or more and re- 
mainder latch levels, distributing the single data XOR 
trees in the data partition over two or more data XOR sub- 
tree levels and distributing all additional packet data slice 
latches over one or more data latch levels. 
[0008] a fourth aspect of the present invention is a method of 
designing a circuit, the method comprising: (a) distribut- 



ing a current cyclic redundancy check (CRC) remainder 
XOR calculation of an M-bit redundancy check circuit into 
a remainder partition comprising multiple levels of re- 
mainder XOR subtrees and having remainder latches be- 
tween the levels of remainder XOR subtrees; and (b) dis- 
tributing a packet data slice XOR function of the M-bit re- 
dundancy check circuit into a data partition of comprising 
multiple levels of data XOR subtrees and having data 
latches between the levels of data XOR subtrees. 
Brief Description of Drawings 

[0009] The features of the invention are set forth in the ap- 
pended claims. The invention itself, however, will be best 
understood by reference to the following detailed descrip- 
tion of an illustrative embodiment when read in conjunc- 
tion with the accompanying drawings, wherein: 

[0010] FIG. 1 is an exemplary m-bit CRC circuit; 

[0011] FIGs. 2 through 7 illustrate a set of logical steps applied 
to CRC circuit 100 of FIG. 1 in developing a CRC circuit of 
the present invention; 

[0012] FIG. 8 is an exemplary 32-bit CRC circuit according to the 
present invention; 

[0013] FIG. 9 is a schematic circuit diagram of a remainder parti- 



tion of the CRC circuit of FIG. 8; 

[0014] FIG. 10 is a schematic circuit diagram of a data partition 
of the CRC circuit of FIG. 8; 

[0015] FIG. 11 is an exemplary generic scalable M-bit CRC circuit 
according to the present invention; 

[0016] FIG. 12 is a schematic circuit diagram of a remainder par- 
tition of the CRC circuit of FIG. 11; 

[0017] FIG. 13 is a schematic circuit diagram of a data partition 
of the CRC circuit of FIG. 11; 

[0018] FIG. 14 is a flowchart of the method of designing a re- 
mainder partition of a CRC circuit according to the present 
invention; and 

[0019] FIG. 15 is a flowchart of the method of designing a data 
partition of a CRC circuit according to the present inven- 
tion. 

Detailed Description 

[0020] The present invention is related to pending Patent Appli- 
cation S/N 10/729,277 filed on December 4, 2003 which 
is hereby incorporated by reference in its entirety. 

[0021] The terminology Q by P-way XOR subtree defines an XOR 

subtree having Q outputs and (PxQ) inputs. The notation 
p 

QAP should be read as Q . Pipelining is a method of pro- 
cessing on a computer that allows fast parallel processing 



of data. In a fully pipelined CRC circuit, this means parallel 
processing of both the packet data slice and the CRC re- 
mainder simultaneously. The term level may be read as 
stage. 

[0022] An XOR operation is defined herein and in the claims as a 
logical operation over an arbitrary number of binary in- 
puts with a single binary output. The logical operation re- 
sults in a logical 1 when an odd number of inputs are log- 
ical ones and the rest of the inputs are logical zero, else 
the output is a logical zero. An XOR gate is defined herein 
and in the claims as a circuit that implements an XOR op- 
eration. An XOR level is defined herein and in the claims 
as an acyclic (not cyclic) network of XOR gates with an ar- 
bitrary number of binary inputs and an arbitrary number 
of outputs. The inputs of the XOR gates are connected to 
outputs of a first set of latches. The outputs of the XOR 
gates are connected to inputs of a second set of latches. 

[0023] FIG. 1 is an exemplary m-bit CRC circuit. In FIG. 1, a CRC 
circuit 100 includes an m-bit packet data slice latch 105 
(latching the cycle j data packet slice), a single, m-bit in- 
put/n-bit output data XOR tree 110 and an n-in- 
put/n-output remainder XOR tree 115, a 
2n-input/n-output combinatorial XOR tree 120 



(generating the cycle j + 1 remainder) and an n-bit current 
CRC remainder latch 125 (latching the cycle j remainder). 
The bit width of remainder latch 125 defines the CRC 
type, in the present example an n-bit CRC. The outputs of 
packet slice latch 105 are connected to the inputs of data 
XOR tree 110 by an m-bit bus; each bit connected to a 
different input. The outputs of data XOR subtree 110 are 
connected a first set of n-inputs of combinatorial XOR 
tree 120 by an n-bit bus, each bit connected to a different 
input. The outputs of remainder XOR subtree 115 are 
connected to a second set of n-inputs of combinatorial 
XOR tree 120 by an n-bit bus, each bit connected to a dif- 
ferent input. The outputs of combinatorial XOR subtree 
120 are connected to the inputs of current CRC remainder 
latch 115 by an n-bit bus, each bit connected to a differ- 
ent input. The outputs of current CRC remainder latch 115 
are connected to the inputs of remainder XOR tree 115 by 
an n-bit bus, each bit connected to a different input, thus 
providing the cyclic portion of the CRC result. 
[0024] D ata D j ts are moved from packet data slice latch 105 and 
through data XOR tree 110 and combinatorial XOR tree 
120 into current CRC latch 125 by a clock signal CLK. Re- 
mainder bits are cycled from current CRC remainder latch 



125, through remainder XOR tree 115 and combinatorial 
XOR tree 120 and back to the current CRC remainder latch 
by clock signal CLK. The arrangement of XOR gates in XOR 
tree 110 implements the CRC code and performs the ac- 
tual CRC calculation. 

[0025] As the number of input bits to an XOR tree increases, the 
depth of XOR gates (the number of XOR gates connected 
in series from the input to the output of the XOR tree ) as 
well as the number of inputs in each individual XOR gate 
in the XOR tree increases. At some point, it will take more 
than a single clock cycle for data bits to travel through the 
data XOR tree and remainder bits to travel through the re- 
mainder XOR tree and the CRC circuit will generate an er- 
roneous CRC result. The present invention avoids XOR 
tree data bit propagation time problems by partitioning 
both the data XOR tree and the remainder XOR-tree into 
levels, each level small enough not to have a data bit 
propagation time problem and to avoid the data bit path 
or the remainder bit path gating the CRC operation. It 
should be noted that data bit and remainder bit propaga- 
tion time is dependent on the integrated circuit technol- 
ogy that the CRC circuit is physically fabricated in. 

[0026] The first partition is a set of XOR subtrees and latches for 



processing the data bits and the second is a set of XOR 
subtrees and latches for processing the remainder bits of 
the CRC. Both partitions are multi-level partitions, each 
level comprised of multiple XOR subtrees and latches. 
Each XOR subtrees of the data partition is no slower than 
the slowest XOR subtree in the remainder partition Each 
level of XOR subtrees perform a portion of the CRC calcu- 
lation and each XOR subtree belonging to a particular 
level performs a portion of the portion of the CRC calcula- 
tion performed by the level. The size of the largest re- 
mainder subtree is chosen so that all the XOR calculation 
it performs can be completed in one clock cycle at the de- 
sired frequency. Since all the XOR subtrees of the data 
partition and the remainder partition are no slower than 
the slowest remainder XOR subtree, each data partition 
levels portion of the CRC is likewise performed in one 
clock cycle or less. 
[0027] piGs. 2 through 7 illustrate a set of logical steps applied 
to CRC circuit 100 of FIG. 1 in developing a CRC circuit of 
the present invention. The remainder for any given cycle is 
the result of the same set of XOR operations performed on 
the previous remainder and previous data. In FIG. 1, the 
remainder for cycle j + 1 was the result of XOR operations 



on the data packet slice for cycle j and the remainder for 
cycle j. The remainder for cycle j can be expressed as the 
same XOR operations performed on the data packet slice 
for cycle j-1 and the remainder for cycle j-1. An XOR op- 
eration is in reality a mathematical calculation and just as 
mathematical calculations make use of substitutions, the 
XOR tree operations can be substituted. 

[0028] Thus in FIG. 2, the j-1 cycle has been substituted for the j 
cycle to create a CRC circuit 100A which is functionally 
identical to CRC circuit 100 of FIG. 1, though its operation 
might be slightly slower. However, in FIG. 2, an m-bit 
packet data slice latch 130 (latching the cycle j-1 data 
packet slice), an m input/n-output data XOR tree 135, an 
n-bit current CRC remainder latch 140 (latching the cycle 
j-1 remainder), an m-input/n-output remainder XOR tree 
and a 2n-input/n-output combinatorial XOR tree 150 
have been added. Data XOR trees 110 and 135 have about 
the same XOR gate count and remainder XOR trees 115 
and 145 have about the same XOR gate count, so the 
overall XOR gate count of CRC 100A is greater than that of 
CRC circuit 100 of FIG. 1. 

[0029] interconnections in FIG. 2 are the same as in FIG. 1 except 
for the following: The outputs of current CRC remainder 



latch 125 are connected to the inputs of current CRC re- 
mainder latch 140, by an n-bit bus, each bit connected to 
a different input. The outputs of current CRC remainder 
latch 140 are connected to the inputs of remainder XOR 
tree 145, by an n-bit bus, each bit connected to a differ- 
ent input. The outputs of remainder XOR tree 145 are 
connected to a first set of n-inputs of combinatorial XOR 
tree 150 by an n-bit bus, each bit connected to a different 
input. The outputs of packet data slice latch 105 are con- 
nected to the inputs of current data packet slice latch 130, 
by an m-bit bus, each bit connected to a different input 
(in addition to the previous connection to data XOR tree 
110). The outputs of packet data slice latch 130 are con- 
nected to the inputs of data XOR tree 135, by an n-bit 
bus, each bit connected to a different input. The outputs 
of data XOR tree 155 are connected to a second set of n- 
inputs of combinatorial XOR tree 150 by an n-bit bus, 
each bit connected to a different input. 
[0030] | n FIG. 3, a CRC circuit 100B is functionally identical to 

CRC circuit 100A of FIG. 2 and has nearly the same identi- 
cal XOR tree and latch elements. However, the intercon- 
nections between some elements have been rearranged 
topographically in order to cascade the XOR trees. The 



XOR function of remainder XOR tree 115 is applied to data 
XOR tree 135 of FIG. 2 to create data XOR tree 136 of FIG. 
3. In FIG. 3, combinatorial XORtreel20 and remainder 
XOR trees 115 and 145 are cascaded, combinatorial XOR 
trees 120 and 150 and data XOR trees 110 are cascaded 
between current CRC remainder latch 125 and current 
CRC remainder latch 140, combinatorial XOR trees 120 
and 150 and data XOR tree 136 are cascaded between 
current CRC remainder latch 125 and packet data slice 
latch 130, and combinatorial XOR trees 120 and 150 and 
data XOR tree 110 are cascaded between current CRC re- 
mainder latch 125 and packet data slice latch 130. Cas- 
caded is defined as serially connected levels (an XOR tree 
is a level in this context), the output of a previous level 
connected to the input of a subsequent level in the series 
of levels. 

[° 031 ] In FIG. 4, a CRC circuit 100C is functionally identical to 

CRC circuit 100B of FIG. 3 and has the same identical XOR 
tree and latch elements and connections between XOR 
tree and latch elements except remainder XOR trees 115 
and 145 of CRC 100B of FIG. 2 have been combined into a 
single remainder XOR tree 155. Remainder XOR tree 155 
is about the same size (same XOR gate count) as either 



remainder XOR tree 115 or 145 of FIG, 2. Therefore the 
gate count has been reduced in CRC 100C from that of 
CRC 100B of FIG. 3. 
[0032] | n FIG. 5, a CRC circuit 100D has been derived from CRC 
circuit 100 of FIG. 1 by the same process as CRC circuit 
100C of FIG. 4 was derived except that the j-2 cycle has 
been included. In FIG. 5, the outputs of current CRC re- 
mainder latch 125 for (latching the cycle j remainder) are 
connected to the inputs of a current CRC remainder latch 
165 (latching the cycle j-2 remainder) by an n-bit bus, 
each bit connected to a different input. The outputs of 
current CRC remainder latch 165 are connected to the in- 
puts of current CRC remainder latch 140 (latching the cy- 
cle j-1 remainder) by an n-bit bus, each bit connected to 
a different input. The outputs of current CRC remainder 
latch 140 are connected to the inputs of current CRC re- 
mainder latch 140 by an n-bit bus, each bit connected to 
a different input. The outputs of current CRC remainder 
latch 140 are connected to the inputs of an n-in- 
put/n-output remainder XOR tree 160 by an n-bit bus, 
each bit connected to a different input. The outputs of 
current CRC remainder latch 160 are connected to a first 
set of inputs of 2n-input/n-output XOR tree 120 (shown 



generating the cycle j + 1 remainder) by an n-bit bus, each 
bit connected to a different input. The gate count of re- 
mainder XOR tree is about same as that of remainder XOR 
tree 115 of FIG. 1, saving about 2/3 of the total XOR gate 
count required. In general any number of XOR trees can 
be merged with the resultant XOR tree having about the 
same number of XOR gates as any of the pre-merged XOR 
trees. 

[0033] Continuing with FIG. 5, the outputs of a packet data slice 
latch 170 (latching the cycle j-2 packet data slice) are 
connected to a first set of inputs of a 3m-input/n-output 
data XOR tree 175 by an m-bit bus, each bit connected to 
a different input. The outputs of packet data slice latch 
130 (latching the cycle j-1 packet data slice) are con- 
nected to a second set of inputs of 3m-input/n-output 
data XOR tree 175 by an m-bit bus, each bit connected to 
a different input. The outputs of packet data slice latch 
105 (latching the cycle j packet data slice) are connected 
to a third set of inputs of 3m-input/n-output data XOR 
tree 175 by an m-bit bus, each bit connected to a differ- 
ent input. The XOR gate count of data XOR tree 175 is 
about three times that of data XOR tree 110 of CRC circuit 
100 of FIG. 1. The outputs of data XOR tree 175 are con- 



nected to a second set of inputs of combinatorial XOR tree 
120 by an m-bit bus, each bit connected to a different in- 
put. 

[0034] | n FIG. 6, a CRC circuit 100E has been generated by redis- 
tribution of current CRC remainder latches 140 and 165 
and remainder XOR tree 160 of CRC circuit 100D of FIG. 5, 
into two latch/XOR subtree levels 185A and 185B of 
smaller XOR trees and remainder latches as illustrated in 
remainder partition 180. Each XOR subtree has a signifi- 
cantly smaller XOR gate count than remainder XOR tree 
160 of CRC circuit 100D of FIG. 5 and thus while CRC cir- 
cuit 100E is functionally identical to CRC circuit 100 of 
FIG. 1, the time to perform the CRC remainder calculation 
is significantly less for CRC circuit 100E. The method of 
partitioning remainder partition 180 is described infra. 
The total number of latch/XOR levels is equal to the num- 
ber of cycles substituted. For CRC circuit 100E there were 
3 cycles (j, j-1 and j-2) and there are three latch/XOR lev- 
els, 185A, 185B and 185C, latch/XOR level 185C including 
combinatorial XOR tree 120 and current CRC remainder 
latch 125. 

[0035] | n fig. 7, a CRC circuit 100F has been generated by parti- 
tioning of XOR tree 175 of CRC circuit 100E into multiple 



latch/XOR subtree levels 195A through 195N of smaller 
XOR trees and remainder latches as illustrated in data 
partition 190. Each XOR subtree has a significantly smaller 
XOR gate count than data XOR tree 175 of CRC circuit 
100E of FIG. 6 and thus while CRC circuit 100F is func- 
tionally identical to CRC circuit 100 of FIG. 1, the time to 
perform the CRC remainder calculation as well as process 
the packet data slices is significantly less for CRC circuit 
100E. The method of partitioning data partition 190 is de- 
scribed infra. The data partition XOR subtrees are de- 
signed so that no latch/XOR subtree has a greater depth 
of XOR gates (there may be multiple cascaded XOR gates 
in an XOR subtree) than the depth of XOR gates in any 
latch/XOR level in remainder partition 180. The upper 
bound of the number of XOR gates is fixed, regardless of 
the number of previous cycles extracted. For example, for 
CRC circuits implementing 32 CRC calculations, the XOR 
subtree will never have more than 32 bits. If 5 previous 
cycles are used to design the CRC circuit, then the CRC 
circuit can be implemented with just a single XOR2 gate in 
each level of the remainder partition. 
[0036] An initial value must for the current CRC remainder must 
be supplied. Given the CRC circuit structures described 



supra, in reference to FIGs. 6 and 7, the CRC remainder for 
cycle j = l is determined by the value of the packet data 
slice for cycle j, the value of the packet data slice for cycle 
j-1 through the value of the packet data slice for cycle j- 
(a-1) where a is the number of cycles the CRC circuit is 
based on. In the example of CRC circuits 100E and 100F 
of FIGs. 6 and 7 respectively, a=3. 
[0037] The first a-1 cycles of packet data slices will require 

knowing the CRC remainders for cycles prior to the initial 
(0) cycle. The CRC remainder of the initial cycle is a pre- 
determined value, in one example all l"s. CRC remainders 
of cycles prior to the initial cycle can be computed by ar- 
bitrarily picking data slice values prior to the initial cycle, 
for example, all 0"s, and computing what CRC remainder 
values would be required for cycles -1 to -(a-1) to pro- 
duce a cycle 0 remainder of the predetermined initial 
value. This can be accomplished by in a software program 
by inverting the function that computes CRC remainders 
serially (i. e. one bit at a time). With the inverted function, 
the CRC remainder is set to the initial value, and a set of w 
0-bits are fed into the inverted function. This computes 
the CRC remainder for cycle -1. For each set of w 0-bits 
fed into the inverted function, another remainder cycle 



value is computed. 

[0038] once the initial CRC remainder values have been com- 
puted, the XOR operations in the remainder partition must 
be applied to the values such that for a given latch level 
a-c, (c=the cycle of interest) all XOR operations starting at 
the current CRC remainder latch for level 0 and level a-c 
are applied to the initial CRC value for cycle -c. An exam- 
ple of a PERL program for performing (for a=3) the afore- 
mentioned calculations is given in Table I. 

[0039] TABLE 1 

[0040] PROGRAM FOR CALCULATING INITIAL CRC REMAIN- 
DER*! /usr/ bin/ perl 
[0041] $num_cycles_back = 3; 

[0042] $num_bits_per_cycle = 192; 

[0043] for ($i = 0; $i <32; $i = $i +1){ 

[0044] $init_y[$i] = "1"; 

[0045] } 

[0046] f or ($j = 0; $j < $num_cycles_back; $j ++) { 

[0047] f or ($j = o; $i < $num_bits_per_cycle; $i ++) { 

[0048] #REVERSE THE LFSR OPERATION 



[0049] 


$tmp_y[30] 


= 


$init_y[31]; 


[0050] 


$tmp_y[29] 


= 


$init_y[30]; 


[0051] 


$tmp_y[28] 


= 


$init_y[29]; 


[0052] 


$tmp_y[27] 


= 


$init_y[28]; 


[0053] 


$tmp_y[26] 


= 


$init_y[27]; 


[0054] 


$tmp_y[25] 


= 


eval_xor_it($init_y[26], $init_y[0]); 


[0055] 


$tmp_y[24] 


= 


$init_y[25]; 


[0056] 


$tmp_y[23] 


= 


$init_y[24]; 


[0057] 


$tmp_y[22] 


= 


eval_xor_it($init_y[23], $init_y[0]); 


[0058] 


$tmp_y[21] 


= 


eval_xor_it($init_y[22], $init_y[0]); 


[0059] 


$tmp_y[20] 


= 


$init_y[21]; 


[0060] 


$tmp_y[19] 


= 


$init_y[20]; 


[0061] 


$tmp_y[18] 


= 


$init_y[19]; 


[0062] 


$tmp_y[17] 


= 


$init_y[18]; 


[0063] 


$tmp_y[16] 


= 


$init_y[17]; 


[0064] 


$tmp_y[15] 


= 


eval_xor_it($init_y[16], $init_y[0]); 


[0065] 


$tmp_y[14] 




$init_y[15]; 



[0066] 


$tmp_y[13] : 


= $init_y[14]; 


[0067] 


$tmp_y[12] : 


= $init_y[13]; 


[0068] 


$tmp_y[ll] : 


= eval_xor_it($init_y[12], $init_y[0]); 


[0069] 


$tmp_y[10] : 


= eval_xor_it($init_y[ll], $init_y[0]); 


[0070] 


$tmp_y[9] = 


eval_xor_it($init_y[10], $init_y[0]); 


[0071] 


$tmp_y[8] = 


$init_y[9]; 


[0072] 


$tmp_y[7] = 


eval_xor_it($init_y[8], $init_y[0]); 


[0073] 


$tmp_y[6] = 


eval_xor_it($init_y[7], $init_y[0]); 


[0074] 


$tmp_y[5] = 


$init_y[6]; 


[0075] 


$tmp_y[4] = 


eval_xor_it($init_y[5], $init_y[0]); 


[0076] 


$tmp_y[3] = 


eval_xor_it($init_y[4], $init_y[0]); 


[0077] 


$tmp_y[2] = 


$init_y[3]; 


[0078] 


$tmp_y[l] = 


eval_xor_it($init_y[2], $init_y[0]); 


[0079] 


$tmp_y[0] = 


eval_xor_it($init_y[l], $init_y[0]); 


[0080] 


$tmp_y[31] : 


= $init_y[0]; 


[0081] 


# last bit is serial out, must be 1 


[0082] 


for($k = 0; $k < 32; $k = $k +1){ 



[0083] $init_y[$k] = $tmp_y[$k]; 

[0084] } 

[0085] } 

[0086] for ($| = 0; $1 < 32; $! + +){ 

[0087] $initial_val[$j][$l] = $init_y[$l]; 

[0088] } 

[0089] } 

[0090] f or ($ m = 0; $m < $num_cycles_back; $m++) { 

[0091] for ($i = 0; $i < 32; $i ++){ 

[0092] $init_rev_y[31 - $i] = $initial_val[$m][$i]; 

[0093] } 

[0094] $init_val = join "", @init_rev_y; 

[0095] $init_val = oct("0b$init_val"); 

[0096] P rintf("%0d cycles back: %0x\n'\($m+l),$init_val); 

[0097] } 

[0098] sub eval_xor.it { 

[0099] $ op i = shift(@_); 



[0100] $ 0P 2 = shift(@_); 

[0101] if ($opl eq "0"){ 

[0102] return $ op 2; 

[0103] } e | se { 

[0104] return ($op2 eq ,, 1")?"0 ,, : ,, 1"; 

[0105] } 
[0106] } 

[0107] FIG. 8 is an exemplary 32-bit CRC circuit according to the 
present invention. In FIG. 8, a CRC circuit 200 includes a 
32-bit current CRC remainder latch 205, a 32-bit remain- 
der partition 210, a 2048-bit packet data slice partition 
215 and a 32 by 2-way combinatorial XOR tree 220. The 
outputs of current CRC remainder latch 205 are connected 
to the inputs of remainder partition 210 by a 32-bit bus, 
each bit connected to a different input. The outputs of re- 
mainder partition 210 are connected to a first set of in- 
puts of combinatorial XOR tree 220 by a 32-bit bus, each 
bit connected to a different input. The outputs of packet 
data slice partition 215 are connected a second set of in- 
puts of combinatorial XOR tree 220 by a 32-bit bus, each 
bit connected to a different input. The outputs of combi- 



natorial XOR tree 220 are connected to the inputs of cur- 
rent CRC remainder latch 205 by a 32-bit bus, each bit 
connected to a different input. 

[0108] FIG. 9 is a schematic circuit diagram of remainder parti- 
tion 210 of CRC circuit 200 of FIG. 8. In FIG. 9, remainder 
partition 210 includes nine 32 by (0-3) XOR subtrees 225 
arranged in sets of 3, and corresponding 32-bit latches 
230, three sets of 32 by 3-way XOR subtrees 235 and 
corresponding 32-bit latches 240, and a 32 by 3-way XOR 
subtree 245. Latches 225are partition level 0 latches, and 
latches 240 are partition level 1 latches. 

[0109] Each XOR subtree 225 is connected to current CRC re- 
mainder CRC latch 205 (see FIG. 8) by 0 to 3, 32-bit in- 
puts. Each of the 32 outputs of each XOR subtree 225 is 
connected to a different input of a corresponding latch 
215. There need not be any particular relationship be- 
tween a particular input of a particular XOR subtree 225 
and a particular bit from remainder CRC latch 205 (see 
FIG. 8). Each of the 32 outputs of each latch 230 in each 
set is connected to a different input of a different 32 input 
set of the 96 inputs of a corresponding XOR subtree 235. 
Each of the 32 outputs of each XOR subtree 240 is con- 
nected to a different input of a 32 input set of the 96 in- 



puts of XOR subtree 235. Each of the 32 outputs of each 
XOR subtree 235 is connected to a different input of a 
corresponding latch 245. The 32 outputs of each latch 
240 are connected to a different input of a32 input set of 
the 96 inputs of XOR subtree 245. 
Remainder bits are moved from remainder CRC latch 205 
(see FIG. 8) through XOR subtrees 225 into latches 230 by 
clock signal CLK. Remainder bits are moved from latches 
230 through XOR subtrees 235 and into latches 240 by 
clock signal CLK. Remainder bits are moved from latches 
240, through XOR subtree 245 into combinatorial XOR 
tree 220 (see FIG. 8) by clock signal CLK. The specific ar- 
rangement of XOR gates in XOR trees 210, 235 and 225 
implements the CRC code and performs the actual CRC 
calculation. 

[° 1 1 1 ] FIG. 10 is a schematic circuit diagram of data partition 

215 of CRC circuit 200 of FIG. 8. In FIG. 10 data partition 
215 includes three 2048-bit packet data slice latches 250, 
251 and 252, three sets of 32 by (0 to 3)-way XOR sub- 
trees 260 and corresponding 32-bit latches 265, inter- 
vening latch and XOR levels 2 through 5, three 32 by 
3-way XOR subtrees 270 and corresponding 32-bit 
latches 275, and a 32 by 3-way XOR subtree 280. Latches 



265 are partition level 1 latches, and latches 275 are par- 
tition level 7 latches, so there are eight latch levels in data 
partition 215. 

[° 112 ] Each XOR subtree 260 is connected to packet data slice 

latch 250, 252 or 252 by 0 to 3, 32-bit inputs (i. e. 96 in- 
puts to each XOR subtree). Each of the 32 outputs of each 
XOR subtree 260 is connected to a different input of a 
corresponding latch 265. There need not be any particular 
relationship between a particular input of a particular XOR 
subtree 270 and a particular bit from packet data slice 
latch 250, 251 or 252. Each of the 32 outputs of each 
latch 265 of each set of 8 latches 265 is connected to a 
different input of a corresponding XOR subtree 270. Each 
of the 32 outputs of each latch 275 is connected to a dif- 
ferent input of XOR subtree 280. 

[0113] D a ta bits are moved from packet data slice latches 250, 
251 and 252 through XOR subtrees 260 into latches 265 
by clock signal CLK. Data bits are moved from latches 265 
through XOR subtrees 270 and into latches 275 by clock 
signal CLK. Data bits are moved from latches 275, 
through XOR subtrees 280 into combinatorial XOR tree 
220 (see FIG. 8) by clock signal CLK. The specific arrange- 
ment of XOR gates in XOR trees 260, 270 and 280 imple- 



merits the CRC code and performs the actual CRC calcula- 
tion. 

[° 114 ] Returning to FIG. 8, the structure of data partition 215 is 
determined by maximum delay through any level of re- 
mainder partition 210. For example, if the fastest level of 
remainder partition 210 is implemented using only 
3-input and 2-input XOR gates and the largest CRC re- 
mainder expected is 1059-bits for each of latches 250, 
251 and 252 then the maximum size of a subset of the 
32-bit CRC remainder is 20-bits. The value 1059 is spe- 
cific to the particular CRC calculation and number of bits 
processed per CLK cycle. The value 20 is also determined 
by the particular CRC calculation as are the particular the 
bits of the 32-bit input to remainder partition. When par- 
titioning data partition 215, each level must include 
greater than a 3-input XOR operation. To process 
2048-bits of data in one clock cycle, the worst-case sin- 
gle XOR operation must operate on 3*1059 = 3177 bits. 

[0115] a data packet's 32-bit CRC remainder is calculated by ini- 
tializing CRC 200 to a value of OxFFFF.FFFF, and then 
processing the packet through the CRC circuit. Given the 
current CRC remainder value and a 2048-bit slice of the 
data packet, the next CRC remainder is calculated and 



then latched. The next CRC remainder value is calculated 
by performing a bit wise XOR operation on the two 32-bit 
outputs of data partition 215 and remainder partition 210. 
Each bit of the output of remainder partition 210 is calcu- 
lated by performing an XOR operation over a subset of 
bits of the current CRC remainder value. Each bit of the 
output data partition 215 is calculated by performing an 
XOR operation over a subset of bits of the portion of 
packet data currently being processed. 
16 ] The output of both remainder partition 210 and data par- 
tition 215 are the result of several levels of XOR opera- 
tions. The topmost XOR operation of data partition 215 
are the result of several levels of XOR operations. The 
topmost operation of data partition 215 (that performed 
by XOR subtree 280, se FIG. 10) is picked such that each 
output is fed by an XOR operation on three sets of inputs. 
The remaining, lower XOR operation level sizes (those 
performed by XOR subtrees 260 an by XOR subtrees 270, 
see FIG. 10) are picked arbitrarily to balance level sizes 
across the bottom two levels. There are 3216 levels total 
in data partition 215 (3 7 = 2187 level 0 levels, 3 6 = 729 
level 1 levels, 3 5 = 245 level 2 levels, 2 4 = 81 level 3 lev- 
els, 3 3 = 27 level 4 levels, 3 2 =9 level 5 levels, 3 level 6 



levels, and 1 level 7 level). The output of each sub- 
partition except for the last level, is latched. When the last 
2048-bits of a data packet are processed, the next CRC 
remainder is the CRC value for the packet. 

[0117] FIG. 11 is an exemplary generic scalable M-bit CRC circuit 
according to the present invention. In FIG. 11, a CRC cir- 
cuit 300 includes an M-bit current CRC remainder latch 
305, an M-bit remainder partition 310, a W-bit packet 
data slice partition 315 and an M by 2-way combinatorial 
XOR tree 320. The outputs of current CRC remainder latch 
320 are connected to the inputs of remainder partition 
310 by an M-bit bus, each bit connected to a different in- 
put. The outputs of remainder partition 310 are connected 
to a first set of inputs of combinatorial XOR tree 320 by 
an M-bit bus, each bit connected to a different input. The 
outputs of packet data slice partition 315 are connected a 
second set of inputs of combinatorial XOR tree 320 by an 
M-bit bus, each bit connected to a different input. The 
outputs of combinatorial XOR tree 320 are connected to 
the inputs of current CRC remainder latch 305 by an M-bit 
bus, each bit connected to a different input. 

[0118] FIG. 12 is a schematic circuit diagram of remainder parti- 
tion 325 of CRC circuit 300 of FIG. 11. In FIG. 12, remain- 



der partition 310 includes, B of M by B-way XOR subtrees 
325 in the lowest remainder of XOR subtree level and cor- 
responding M-bit latches 330, intermediate levels of M by 
B-way XOR subtrees and corresponding latches (not 
shown), B 2 of M by B-way XOR subtrees 335 and corre- 
sponding M-bit latches 340, B of M by B-way XOR sub- 
trees 345 and corresponding M-bit latches 350, and an M 
by B way XOR subtree 355 in the highest remainder XOR 
subtree level. A is the number of cascaded XOR levels in 
the remainder partition, B is the maximum number of XOR 
operations to be performed in a single XOR level and M is 
maximum number of bits in a CRC remainder. Latches 
330 are partition level 1 latches, latches 330 are level 
(A-l) latches and latches 350 are level A latches, so there 
are A partition levels in remainder partition 310. 
1Q ] Each XOR subtree 325 is connected to remainder latch 
305 by variable numbers of M-bit input. Each of the M 
outputs of each XOR subtree 325 is connected to a differ- 
ent input of a corresponding latch 330. There need not be 
any particular relationship between a particular input of a 
particular XOR subtree 325 and a particular bit from 
packet data slice latch 305 (see FIG. 11). After progressing 
through intermediate partition levels, each of the M out- 



puts of each of XOR subtrees 335 is connected to a differ- 
ent input of corresponding latches 340. Each of the M 
outputs of each latch 340 is connected to a different input 
of corresponding XOR subtrees 345. Each of the M out- 
puts of XOR subtrees 345 are connected a different input 
of corresponding latches 350. Each of the M inputs of 
latches 350 is connected to different inputs of XOR sub- 
tree 355. Each of the M outputs of XOR subtree 355 is 
connected to a different input of a first M member subset 
of the 2M inputs of combinatorial XOR subtree 305 (see 
FIG. 11). 

[0120] Remainder bits are moved from CRC remainder latch 305 
(See FIG. 11) through the level levels by a clock signal CLK 
applied to the latches within each partition level. 

[0121] The structure of remainder partition 310 is determined by 
A, I, the maximum size of a subset of the M-bit CRC re- 
mainder and B. The value I is specific to the particular CRC 
calculation and number of bits processed per CLK cycle. A 
also satisfies the relationship that A is the smallest whole 
positive number greater than log I. 

B 

[0122] FIG. 13 is a schematic circuit diagram of data partition 
370 of CRC circuit 300 of FIG. 11. In FIG. 13, data parti- 
tion 370 includes W-bit packet data slice latches 360 



through 361 and 362 through 362, B Y of M by B-way XOR 
subtrees 365 in the lowest data XOR subtree level and 
corresponding M-bit latches 370 (B was defined supra and 
Y and M are defined infra), intermediate levels of M by B- 
way XOR subtrees and corresponding latches (not shown), 
B 2 of M by B-way XOR subtrees 375 and corresponding 
M-bit latches 380, B of M by B-way XOR subtrees 385 and 
corresponding M-bit latches 390, and, an M by B-way 
XOR subtree 395 in the highest data XOR subtree level. 
Packet data slice latches 360 through 361 and 361 
through 362 are partition level 0 latch. Latches 370 are 
partition level 1 latches, latches 380 are partition level 
(Y-l) latches and latches 390 are partition level Y latches, 
so there are Y partition levels in data partition 315. 
[0123] Each XOR subtree 365 is connected to packet data slice 
latches 360 through 361 or 361 through 362. Each of the 
M outputs of each XOR subtree 365 is connected to a dif- 
ferent input of a corresponding latch 370. There need not 
be any particular relationship between a particular input 
of a particular XOR subtree 365 and a particular bit from 
packet data slice latch 360. After progressing through in- 
termediate partition levels, each of the M outputs of each 
of XOR subtrees 375 is connected to a different input of 



corresponding latches 380. Each of the M outputs of each 
latch 380 is connected to a different input of correspond- 
ing XOR subtrees 385. Each of the M outputs of XOR sub- 
trees 385 are connected to a different input of corre- 
sponding latches 390. Each of the M outputs of latches 
390 is connected to different inputs of XOR subtree 395. 

[0124] D ata D j ts are moved from packet data slice latch 360 
through the (Y-l) partition levels by a clock signal CLK 
applied to the latches within each partition level. The spe- 
cific arrangement of XOR gates in the XOR subtrees of the 
various partition levels of data partition 315 and XOR 
subtrees 365 and 375 through 385 and 395 implements 
the CRC code and performs the actual CRC calculation. 

[0125] Returning to FIG. 11, the structure of data partition 315 is 
determined by maximum delay through the any XOR sub- 
tree of remainder partition 310. The maximum number of 
inputs of any XOR subtree in CRC circuit 300 is B. 

[0126] a data packet's M-bit CRC remainder is calculated by ini- 
tializing CRC circuit 300 to a value of -1, and then pro- 
cessing the packet through the CRC circuit. Given the cur- 
rent CRC remainder value and a W-bit slice of the data 
packet, the next CRC remainder is calculated and then 
latched. The next CRC remainder value is calculated by 



performing a bit wise XOR operation on the two M-bit 
outputs of XOR remainder partition 310 and data partition 
315. Each bit of the output of remainder partition 310 is 
calculated by performing an XOR operation over a subset 
of bits of the current CRC remainder value. Each bit of the 
output of data partition 315 is calculated by performing 
an XOR operation over a subset of bits of the portion of 
packet data currently being processed. 

[0127] The outputs of both of remainder partition 310 and data 
partition 315 are the results of several levels or partitions 
of XOR operations performed as illustrated in FIGs. 11 and 
12. The topmost XOR operation of data partition 315 (that 
performed by XOR subtree 395) is picked such that each 
output is fed by an XOR operation on N-inputs. The out- 
put of each level is latched. When the last W-bits of a data 
packet are processed, the next CRC remainder is the CRC 
value for the packet. 

[0128] FIG. 14 is a flowchart of the method of designing a re- 
mainder partition of a CRC circuit according to the present 
invention. In step 400A, I, the largest number of bits in a 
subset of bits of the current CRC remainder to be pro- 
cessed is determined. In step 400B, B, the maximum 
number of XOR operations to be performed in a single 



level of the remainder partition. In step 405, the number 
of levels (A) in the remainder partition are calculated using 
the formula A= smallest whole positive number greater 
than log I. In step 410, the previous A-l cycles of CRC 

B 

calculation are substituted into the current CRC remain- 
der. This is a calculation in terms of the current packet 
data slice and previous A-l data packet slices and the 
CRC remainder value from cycle j-(A-l). In step 415, the 
CRC remainder partition is leveled so no XOR operation in 
the remainder partition has more than B inputs. In step 
420, a latch is inserted between the output of previous 
XOR subtrees and subsequent XOR subtrees of the re- 
mainder partition. In step 425, the values previous CRC 
remainders and corresponding previous packet data slices 
for j=(a-l) cycles that will result in the j cycle value that 
would be the required initial CRC remainder value are cal- 
culated. 

[0129] The design of the data partition occurs after the design of 
the remainder partition through connector A and is illus- 
trated in FIG. 15 and described infra. 

[0130] FIG. 15 is a flowchart of the method of designing a data 
partition of a CRC circuit according to the present inven- 
tion and is a continuation through connector A of the 



flowchart of FIG. 14. In step 435, the largest number of 
XOR gate levels (Y-l), in the data partition is calculated 
using the formula that (Y-l) is equal to the smallest whole 
positive number greater than the log to the base B of B 
times the largest number of bits J of a subset of the W- 
bits of the packet data slice latch. In step 440, the largest 
number of XOR operations in the data partition is set to B 
so no XOR operation of the data partition no slower than 
the slowest XOR operation performed by the remainder 
partition. In step 445, the data packet partition is parti- 
tioned into XOR subtrees such that no XOR subtree of the 
data packet slice XOR subtree has more inputs then the 
number of inputs remainder partition. This number is B. In 
step 450, the XOR output of every XOR subtree in the data 
partition is latched except the topmost XOR subtree. 

[0131] Thus, the present invention provides a more efficient CRC 
circuit than presently available. 

[0132] The description of the embodiments of the present inven- 
tion is given above for the understanding of the present 
invention. It will be understood that the invention is not 
limited to the particular embodiments described herein, 
but is capable of various modifications, rearrangements 
and substitutions as will now become apparent to those 



skilled in the art without departing from the scope of the 
invention. Therefore, it is intended that the following 
claims cover all such modifications and changes as fall 
within the true spirit and scope of the invention. 



